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Expression vector*. 

St> There are described a number of plasmld vectors 
Me for the expression of genetic m-t»* L n *£ r £ 
!„ "levels In yeasts. The plaamlds each comprise a 
yeLt select va y maricer. a ye«t replication o^nand la 
2 promoter positioned relative to a unique restrlct- 
lonslte In such a way that expression wj"'***™ 
of a ?~lypepade coding sequence Inserted at _*e re- 
.tlctlSn V. The promoters used are derivedfrom *e 
S- region of a gene coding for a yeast gr/w^cen- 
d %SXL pho8pnog.yc««te 

ZZ realon of the yeast TRP1 gene. In one t******* ** A 
< mid conmina apromoTSTderlved from j>oth *e3 ^ 
realone of the PGK gene. The replication systems used 
10 S me yaaaT^%plioation origin or an ^nomoua 
CO replicating eequence (ARS) atabillaed with «*ASS «*- 
SbMg eequence (ASS). The replication systems altow 
W for a choice of high or low copy n^ber per cell, ne 
CO promoter sequencee allow for a choice of high or low 
2 e^eaelon level. A kit including vectors hav^a corn- 
S Wnation of these alternative features la des^. Yeast 
° expression vectors Including a gene for coding tor nu- 
© man Interferon-a are described. 
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EXPRESSION VECTORS' 

This invention relates to the field of molecular 
biology and in particular to plasmid vectors suitable for 
the expression, at various levels, of genetic material xn 
yeasts. 

Recently plasmids have been developed that can be 
used as replication vectors in yeast (Utruhl et al (1979) 
PNAS 76 1035 and Kingsman et al (1979) Gene 7 141). 

Yeast replication vectors are capable of autonomous 
replication within a yeast host organism and are therefore 
suitable for introducing foreign DNA into yeasts. 

The vectors have alsfc 
been used to isolate a portion of -yeast DNA for further 
analysis. Whilst such known systems are capable of re- 
liable replication within a yeast host organism they are 
not, to a significant extent, themselves capable of 

expression of inserted DNA. 

The production of useful and interesting polypeptides 
by the exploitation of recombinant DNA techniques has 
hitherto been centred around E. coli as a host/vector 
system (Martial e t al (1979) Science 205 602 and Nagata 
et al (1980) Nature 284 316). In general these express- 
ion systems have depended on a plasmid vector containing 
an E. coli promoter sequence, a ribosome binding site 
(Shine-Delgarno sequence) and often the first few codone. 
of an E.coli coding sequence to which the -foreign" cod- 
ing sequence is joined (Hallewell and Emtage (1980) Gene 
9 27). In many cases, therefore, fusion proteins are 
synthesised, although more recently procedures have been 
developed to allow synthesis of "foreign" proteins with- 
out attached E. coli amino acid sequences (Guarente e t al 

(1980) Cell. 20 543). 

In some situations E. coli may prove to be unsuitable 
as a host/vector system. For example E. coli contains a 
number of toxic pyrogenic factorsthat must be eliminated 
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from any potentially useful pharmaceutical product. The 
efficiency with which purification can be achieved wxll, 
of course, vary with the product. Also the proteolytic 
activities in E. -coli may seriously limit yields of some 

5 useful products (e.g. Itakura et al (1977) Scxence 198 
1056)". These and other oonsiderations have led to in- 
creased interest in alternative host/vector systems, xn 
particular the use of eukaryotic systems for the product- 
ion of eukaryotic products is appealing. Amongst the 

10 eukaryotic organisms suitable for exploitation perhaps 

the easiest to manage is the yeast .cnrrhnrnmyres cerevxsxae. 
Yeast is cheap, easy to grow in large quantities and xt 
has a highly developed genetic system. 

It is an object of this invention toprovidea yeast 

15 vector system capable of expressing an inserted polypep- 

tide coding sequence. 

According to the present invention we provide a 
. yeast expression vector comprising a yeast selectxve 
. marker, a yeast replication origin and a Veast promoter 
20 positioned relative to a unique restriction sxte xn such 
a way that expression may be obtained of a polypeptide 
coding sequence inserted at the restriction site. Pre- 
ferably the expression vector should include at least a 
portion of a bacterial plasmid. This enables the yeast 
25 expression vector to be manipulated in a bacterial host 

system (e.g. E. coli)- 

We have used two types of yeast replication origin 
and selective marker which are known to the art of yeast 
replication vector construction. The first is based on 
30 the replication region of the natural yeast plasmxd 2p 
3 - 2 micron). This plasmid is cryptic, that is 

no readily detectable phenotype and it is present xn about 
100 copies per cell. In a particular example a 3.25kb 
fragment from a 2 F plasmid derivative P 0DB219 (Beggs 
35 (1978) Nature 275 104) has been used. The fragment con- 
cerned comprises two EcoRI fragments (2.5kb and 0.75kb) 
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as follows: 

LEU2 2a 0RI 
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0.75kb 
EcoRI EcoRI 



2.5kb 



EcoRI 



5 (not to scale). - 

The LEU2 selective marker surrounds the internal 
EcoRI site and may be disrupted by cleavage at this sxte. 

sequences have been described in detail (Hartley 
and Donelson (1980) Nature 286 560) and the LEU2 region 

10 has also been the subject of study (Dobson et al C1981) 
Gene 16 133). The 3.25kb EcoRI fragment shown above has 
been ^sed in the expression vectors of the present invent- 
ion as a selection/replication module. Expression vectors 
of the present invention including the fragment may be 

. , . a i r\ %/oo«f with a copy number of about 5u- 
15 stably maintained in yeast wun a CU H/ 

100 plasmids per cell. 

The second type of yeast replication origin and marker 
sequence depends upon autonomous replicating sequences 
(ARS) derived from yeast chromosomal DNA. The best char- 
20 a^rised of these sequences is 1.45kbp EcoRI 

which contains both the yeast TRPJ. gene and an ARS. C^L) 
(Kingsman et al (1979) Gene 7 141 and Struhl et al (1979) 
P.N.A.S. 76 1035). This fragment has been inserted xnto 
PBR322 (abacterial vector) to give the plasmid known as 
25 YRp7, which is capable of replication in both E. colx and 
yaast host systems. The ARS-based plasmids are extremely 
unstable, being lost almost entirely in the absence of 
selection and being maintained at only about 50* in the 
presence of selection, unless a second sequence, an ARS. 
30 stabilising sequence (or ASS) is covalently joined to the 
ARS sequence. It now seems likely that an ASS is a centro- 
~ic DNA .sequence (Clarke and J.Carbon (1980) Nature 287 
504). A useful fragment is the 1.45kb TRM. : 
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ARS EcoRI fragment modified to contain, 
ment which contains an "ASS: 
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627. Sau3a frag- * 



TRP1 coding sequence 




EcoRI 



BolII 



EcoRI 



ARS* 



ASS 
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20 



(not to scale). 

The EcoRI fragment shown ia-sdistaly '^^^ 
us ed in t„a s„pcsssion vectors of . pros n t « v-tx. ^ 
a selection/pepXioation module- E * preSS1 ° n stably 
p„so„t invention containing thl. ^ ^..id 

stained in yeast „itH a copy - » 
per cell. They segregate in an oroer 

pr ovid.d o yeast expression ~ ^.'^V. 

_ „ 4_ l.-ct a portion or tne p teryj. 
moter comprises at least a p vea st 
9 ao. coding for s yeast glycolytic enzyme 

he. ohosphoglucose isomerase, p»<«> 
olvcolytic enzyme may oejpnospuuy „«,__ 
giycoiyt nrinlase triose phosphate isomerase, 

phofructo kinase, aldolase,. trios p p o - ase 
glyceraldehyde 3 phosphate dehydrogenase, enclose 

pyr uvste kinase, >^^aT egression vector 

Especially preferred is ye at least a portion of 
herein the yeast proper ^ (££ K> 

;rs 0 t n e°xLession vectoJwhich include at least a 
gene . Yeast expressi glycolytic enzyme are 

portion of the 5« region of a yeast 9 > J 

to expression control by varying the level o 
susceptible to expres nut rient medium of a yeast 

fermentable carbon source in the fer mentable 
tran3 formed with - ^ ' f „ Referred aspect of 

carbon source is glucose, xn vector 
ca on ^ . . : _ we orov ide a yeast expression vector 
the present invention we provio , 
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wherein at least a portion of the *• 'region .of the 
gene is located upstream of the unique restriction site 
and at least a portion of the 3' region of the PGK gene 
is located- downstream of the unique restriction site. 
The terms "upstream" and "downstream"; relate to the dir- 
ection of transcription and translation." 

In an alternative aspect of the invention we provide 
a yeast expression vector wherein the yeast promoter com- 
prises at least a portion. of the *• region of the TRPJ. 
gene. 

The expression vectors of the present invention in- 
clude a yeast replication origin and a yeast selective 
marker. In a preferred embodiment these may comprise a 
fragment containing at least a portion of the yeast plasmxd 
2u replication origin, and at least a portion of the LEU2 
yeast selective marker. In an alternative preferred em- 
bodiment these may comprise a fragment containing at least 
a portion of an autonomous replicating sequence and at 
least a portion of an autonomous replicating sequence 

stabilising sequence. 

A gene inserted into a yeast expression vector of the 
present invention may be expressed as a fusion protein in 
the correct reading frame depending upon the vector chosen. 

In a preferred embodiment of the present invention we 
provide a yeast expression vector containing at least a 
portion of a gene coding for a polypeptide, preferably human 

interferon-oC . 

According to another aspect of the presdnt invention 
we provide a process for the production of e polypeptide 
comprising expressing the said polypeptide in a yeast host 
.organism transformed by a yeast expression vector contain- 
ing a gene coding for the said polypeptide. 

According to another aspect of the invention we pro- 
vide a kit of yeast expression vectors.. The kit may com- 
prise two or more yeast expression vectors of the present 
invention. The object of providing such a kit is to 
facilitate the molecular biologist's routine expression 
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work by affording him a variety of vectors having either 
high or low copy number per cell and either high or low 
levels of expression. The reading frame of inserted DNA . 
may also be selectable by choice of an appropriate vector 
from the kit. In a preferred embodiment we provide a kit 
comprising four or more yeast expression vectors wherein 
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each vector has either of the TRPT V Mil V ASS or LEU2 : 
2u replication origin selective marker and replication 
systems and at least a portion of either of the JRP1 or 
PGK 5* region yeast promoters. 

The present invention is now described with reference 
to the following Examples and to the accompanying draw- 

ings in which s 

Figure 1 is a partial restriction endonuclease map 

of two yeast replication vectors used as precursors in 
the construction of expression vectors of the present 
invention. They are designated pMA3 and pMA?1; 

Figure 2 is a partial restriction endonuclease map 
of overlapping EcoRI fragments of the TRfJ. gene; 

Figure 3 is a schematic diagram showing the con- 
struction of yeast expression vector pMA103; 

Figure 4 is a partial restriction endonuclease map 
of yeast expression vector pMA103; 

Figure 5 is a partial restriction endonuclease map 
of yeast expression vector pMA113; 

Figure 6 is a nucleotide sequence showing the 
sequence of the TRP1 5* control region; 

Figure 7 is a schematic diagram showing the con- 
struction of yeast- expression plasmid pMA36; 

Figure 8a) is a partial restriction endonuclease map 
of the plasmid P MA3-PGK showing the location of the yeast 
PGK gene; 

Figure 8b) is a map of the 2.95kb Hind III fragment 
of oMA3-PGK; 

Figure 9a) is an amino sequence showing the sequence 
ofvresidues 270-400 of yeast PGK ; 

Figure 9b) is a partial endonuclease map of the 

1.95kb Hind III fragment; 

(Figures 9a and 9b) allow -a- comparison of the:P-GK 
amino acid sequence andi restriction sites) ; 

Figure 10 shows the results of a SI RNA protection 
of the Hae III fragment spanning the 5' end of the PGK 
. coding sequence; 
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20 



25 



Figure 11 is a partial restriction endonuclease map 
showing the general structure of the P HA22a deletion 

series; . 

Figure 12 is a nucleotide sequence showing the 

sequence of the 5* region of the PGK gene; 

Figure 13 is a nucleotide sequence showing the 
sequence of the PGK gene from -226 to + 624 with various 
deletion end points marked; 

Figure 14 is a nucleotide sequence showing the 
sequence at the 3' end of the PGK from EcofiJ site to 
nucleotide 140 beyond the stop codon; 

Figure 1 5 is a schematic diagram showing the con- 
struction of yeast expression plasmid P MA3013; 

Figure 16 is a nucleotide sequence showing the 
sequence of a modified BamHI human interferon-^ gene 

fragment; - 

Figure 17 is a partial restriction endonuclease map 

showing a generalised interferon yeast expression plasmid; 

Figure 18 is a reproduction of a Coomassie stained 
SDS-PAGE gel showing (marked with an arrow) the production 
of a interferon fusion protein, produced by pMA230; 

Figure 19 is a graph showing the glucose regulation 

of interferon expression. 

In the drawings restriction endonuclease maps are 
not drawn to scale. The restriction sites are" in some 
cases abbreviated as follows: 



RI = 

Psfc or. P = 
Bam or 

30 Ba = 

Bg = 
Pv = 
Sal or S= 
Ha 3 - 
35 H3 = 
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The yeast expression vectors to be described are 
based on the bacterial plasmid pBR322 and use one or other 
of the yeast replication origin/selective marker modules 
described above. Both modules are EcoRI fragments and are 
therefore readily manipulated. 

We have constructed, using standard techniques, a 
vector designated pMA3 which is composed of the E. coli , 
vector pBR322 and the EcoRI fragment containing part of 
the 2u yeast plasmid as described above. This plasmid in 
0 contrast to many known chimaeric yeast plasmids appears to 
be relatively stable and is maintained in yeast at a high 
copy number of about 50-100 plasmids per cell. 

We have constructed, again using standard techniques, 
a second vector designated P MA91 which is composed of the 
5 E. coli vector pBR322 and the ARS: ASS EcoRI fragment 
described above. This plasmid is again stable in yeast 
but is present at a copy number of 1. 

The two vectors P MA3 and pMA?1 are described by 
partial maps in Fig. 1. They are not vectors falling 
>0 within the ambit of the present invention but rather 

important precursors in the production of vectors of this 
invention. In each case in Fig. 1 the thick line indicates 
the sequence derived from yeast DNA. 

pMA3 and pMA91 DNAs were prepared by standard proced- 
25 ures (Chinault and Carbon (1979) Gene 5 111). P MA3 was 
partially digested with EcoRI and the products separated 
on a 155 agarose gel. The 3.25kb double EcoRI fragment 
containing the 2 H origin of replication and the LEU2 gene 
was purified by the method of Tabak and Flavell (1978) 
30 Nucleic Acids Res. 5 2321). Similarly P MA91 was digested 
to completion with EcoRI and the 1.0kb fragment containing 
the TRP1 gene, ARS1 and an ASS was purified. Two DNA. 
" fragments were therefore available as replication/selection 
system modules. These are referral .to hereinafter as the 
35 Zu,:LEU2 module and the TRP1 : ARS1 : ASS module respectively. 

In the specific embodiment of the invention to be 
described the expression vectors contain one of two types 
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of useful functional promoter sequence. The first comes 
from the 5' region^ of the yeast TRP1 gene and the second 
fro. the 5' region of the yeast PGK gene. In some of 
vectors the 3' region of the yeast PGK gene has been 

included. tpd-i 
The 1.45kb EcoRI fragment containing the yeast TRPJ. 

gene and the ARS1_ has been completely sequenced 
(TSchumper and Carbon (1980) Gene 10 157). The organxsat- 
ion of the fragment is shown in Fig. 2 in which the 
shaded area to the left of the JRP1 *°«"9 sequence xs 
the -5- region of the gene. The -5' control region has 
considerable homology with the analogous regions of the 
iso-1-cytochrome C and GPO genes from yeast (Smith et_al 
(1979) Cell 16 759) Holland. & Holland (1979) JBC 254 
5466). In each case there is a region containing a purxne 
rich strand of about 30 nucleotides which terminates 48- 
76 nucleotides up-stream from the initiation codon. There 
is also a CACACA sequence 10-15 nucleotides up-stream from 
the initiation codon. This hexanucleotide has been seen 
only in yeast" and its proximity to the initiation codon 
n,ay implicate it in translation, possibly ribosome bind- 
ing, although the existence of ribosome binding sxtes 
other than the 5' CAP-structure in eukaryotes seems xn 
doubt (Naksishima et al (1980) Nature 156 226; Stiles £ 
al (1981) Cell 25 277). That signals necessary for TRPJ. 
Expression ate within the 5' flanking region on the 
1.45kb fragment in plasmid YR P 7 (Fig. 2) is certain since 
the gene is expressed with the fragment in both orxent- 
ations in pBR322. However, it is likely that all the 
signals for maximal JRP1 expression are not present 
since there are only 103 nucleotides 5' to the initiating 
- ATG and most eukaryotic genes possess 5' control regions 
considerably longer than this. A 95bp EcoRI-AluI frag- 
ment at the very left end of the 1 .45kb EcoRI fragment 
Oas shown in Fig. 2) should contain signals suffxcxent 
for TRP1 expression since the Alul site is only 8 nucleo- 
tideTTw'ay from the initiating AJG. This fragment there- 
fore provides a potentially useful "mobile promoter" 
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although additional sequences up-stream from this fragment 
may be necessary for maximal expression. The level of , 
expression from the promoter is .expected to be relatively 
low since TRM mRNA is present in about 0.1-0.01S of total 
mRNA. 

The second available yeast promoter sequence is that 
of the .phosphoglycerate kinase (PGK) gene isolated origin- 
ally by Hitzeman etal (1979), ICN-UCLA .SYMP . 14 57). The 
cloned PGK gene is less well characterised than TRPJ. but 
is potentially more useful for higher levels of expression 
in yeast as the single structural PGK gene produces 1-53 
of total polyA-mRNA.and protein. The glycolytic enzyme, 
genes of yeast are regulated by carbon source (Maitra and 
Lobo (1981) JBC 246 475) giving the potential of develop- 
ing a simple control system for the production of heter- 
ologous proteins in yeast. Analysis of protein and nucleic 
acid sequences have enabled us to define the co-ordinates 
of the PGK coding sequence. 

in summary two plasmids, high and low copy number, 
and two promoter sequences, high and low expression, are 
available for use in a yeast expression system. 
It is one aim of the invention to provide a set of vectors 
suitable for the expression, at various levels , of "useful" 
genes in yeast so that expression characteristics for a 
given heterologous protein can be determined quite simply 
by selecting the appropriate plasmid. 

This set comprises all four pairwise combinations of 
the two promoters, TRP1 and PGK and the TRP1 :-ARSj: ASS and 
LEU0.;2u replication origin', selective marker and replication 
systems. In addition the kit contains molecules based on 
the PGK expression system which will permit fusion of useful 
polypeptides to the amino-terminal amino acids of yeast phos- 
phoglycerate kinase in all three codoci reading frames. In 
PGK based expression systems expression can be regulated by 
the availability of glucose. The kit will, therefore, cover 
all possible expression, selection and replication require- 
ments so that any polypeptide coding sequence, complete or 
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partial, can be expressed under almost any control 
condition . 

Table 1 lists the designations of the plasmids in 
the kit and lists their basic properties. ; y 
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TABLE 1 



Saccharomvces Cerevisiae Expression Kit 



Plasmid 
class/number 



E.coli Selection 
& Replication 
System 



Yeast 
& Repli 



ection Expression 
tion System 



pMA 


103 


* 

Ampicillin R 


• 

LEU2:2^t 


TRP1 






r\ r\ n t n 

PBR322 






pMA 


113 


tt 


TRP1 : ARS1 


:ASS TRP1 


pMA 




» 


LEU2:2n 

* 


TRP1 
(extended) 


IpMA 


200? 


n 


ii 


PGK 


IpMA 


200f1 


it 


n 


PGK 


IpMA 


200f2 


if 


n 


PGK 


IpMA 


200f3 


it 


n 


PGK 


IpMA 


250p 


ii 


TRP1:ARS1 


: ASS PGK 


IpMA 


250f1 


it 


» 


PGK 


IpMA 


250f2 


it 


it 


PGK 


IpMA 


250f3 


n 


ii 


PGK 



p 
f1 

f2 
f3 



vector expresses by transcription promotion 

vector produces fusion protein with junction 
between codons 

vector produces fusion protein with junction 
at PGK reading frame +1 

vector produces fusion protein with junction 
at PGK reading frame +2 
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EXAMPLE 1 

A number of yeast expression vectors based on the 5' 
region of the yeast TRP1 gene were constructed. The 
scheme for the construction of yeast expression plasmid 
designated P HA103 is shown in Figures 3a) 3b). Partial 
restriction endonuclease site maps and sequence informat-. 
ion are shown; detailed information is in Tschumper and 
Carbon (1980) Gene 10 157 Hartley and Donelson (1980) 
Nature 286 860 and Sutcliffe (1979) C.S.H.S.Q.B. 43 79). - 
The use of T4 ligase and Bam HI linkers is according to 
Maniatis etal (1978) Cell 15 687 and restriction frag- 
ment purification from polyacrylamide gels was by the 
method of Maxam and Gilbert (1980) Methods in Enz. 65 499. 
E. cdli transformation was as described in Cameron et al 
P.N.A.S. (1975) 72 3416. The Alul site which defines one 
terminus of the EcoRI-AluI fragment at the 5' end of the 
TRP1 gene is located only 8 nucleotides up-stream from the 
ATG initiation codon. Therefore any sequence inserted at 
this Alul site should be efficiently transcribed from the 
TRP1 promoter. If the sequence also contains an ATG initi- 
ation codon close to the 5* end we would also expect 
efficient translation. Therefore the EcoRI-AluI fragment 
(93bp) was purified from other restriction fragments pro- 
duced by an EcoRI arid Alul. digest of YRp7 after fraction- 
ation on a 75S acrylamide gel. This fragment was then li- 
gated to pBR322 cleaved with EcoRI and BamHI linkers, more 
ligase and spermidine were then added to the reaction. 
After incubation for 6h at 2Q°C the DNA was 'phenol extract- 
ed ethanol precipitated and then digested with BamHI to 
cleave the linker. The BamHI was then removed by phenol 
extraction and the mixture of molecules Iigated and used 
to traasform E coli AKEC28. ( AKEC 28 = K.12 tr P C1117 
leuB6 Thy hsdr-hsdnT ) . Transformant colonies containing 
plasmid which had the small EcoRI-BamHI fragment of P BR322 
replaced by the 93bp EcoRI-AluI fragment from YRp7 with a 
BamHI linker attached to the Alul terminus were identified 
on the basis of their tetracycline sensitivity, their 
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positive signal in a "Grunstein and Hogness" hybridisation 
( (1975) P.N.A.S. 72 3961) with the 1 .A5kb TRP1 :ARS1 frag- 
ment as probe and subsequently by a detailed restriction 
analysis of their plasmid DNA. The plasraid thus formed is 
pMA101 Figure 3b). pMA101 was then cleaved at its unique 
EcoRI site, mixed with the 2m: LEU2 replication/selection 
module, ligated and used to transfrom E. coli AKEC 28 
selecting for ampicillin resistance and leucine prototrophy 
All transf ormants of this phenotype contained molecules 
with the same map as that shown as pMA103 Figure 4 or with 
the 2jLi.:LEU2 module in the other orientation. The express- 
ion site in pMA103 is BamHI and it transforms yeast at a 

frequency of 10 5 /Mg* 

Similarly the TRP1 : ARS1 : ASS module was inserted into 
the EcoRI site of pMA101 to construct pMA113 but in this 
case selection was for ampicillin resistance and trypto- 
phan prototrophy. A partial map of pMA113 is shown in 
Figure .5. The yeast transformation frequency is 10 /^g 
with pMA113. 

EXAMPLE 2 

A region of the yeast genome beyond the bounds of 
the 1.45kb EcoRI TRP1 fragment was cloned in order to make 
use of the. entire TRP1 5' control region. 

DNA sequences beyond the limits of the 1.45kb EcoRI ; 
TRP1 fragment are required for maximal expression from the 
TRP1 promoter. We isolated the Hind III fragment that 
overlaps the 1.45kb EcoRI fragment and which contains the 
entire TRP1 5 1 control region (shown as the shaded area 
Figure 2). In order to find the size of that Hind III 
fragment we used the smaller of the EcoRI - Hind III frag- 
ments from the 1.45kb EcoRI fragment (Figure 2) as a probe 

m 

in a Southern hybridisation to total yeast DNA cleaved 
with Hind III. "A single, approximately 2.0kb band was 
visible after autoradiography. Hind III digested total 
yeast DNA was then distributed in a 125 agarose gel and all 
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the DNA in the size range 1.5-2.5kb was purified by the 

method of Tabak & Flavell (1978) NAR 5 2321) and ligated 

with Hind III digested pTR262 (Roberts et al (1980) Gene 

12 123). 700 Tetracycline resistant colonies were then 

screened by the "Grunstein-Hogness" . procedure using the 

purified 1.45kb EcoRI ;TRP1 fragment as a probe. A single 

colony showed hybridisation with this probe and plasmid 

DNA was prepared from this clone. The -plasmid contained 

a 2.2kb Hind III fragment which hybridised specifically 

to the smaller of the EcoRI-Hind III fragments from the- • 

1.45kb EcoRI TRP1 fragment. The nucleotide sequence of 
. 

the -region up-stream from the EcoRI site at position - 103 
(A in ATG is +1) was determined by standard M13/dideoxy 
sequencing procedures (Sanger ^t al (1977) P*N*A.S. 74^ 
6463) and is shown in Figure 6. In this Figure the nu- 
cleotide sequence from 169 to 275 was after Tschumper and 
Carbon (1980) Gene 10. 157. Potentially important features 
are underlined.. New sequence data includes all sequences 
not overlined. In order to construct a derivative of - 
pMA103 that contains the entire TRP1 5 f control region a 
set of constructions was performed as outlined in Figure 7» 
(In this Figure the thick lines indicate DNA derived from 
yeast)* The 2.2kb Hind III fragment was purified by the 
method of Tabak & Flavell (1978) NAR 1 2321) and inserted 
into the Hind III site of pBR322 to form plasmid pMA33. 
The small EcoRI fragment from pMA33 was purified and then 
inserted into the unique EcoRI site of pMA101 (see Figure 
3(b)). The orientation of the fragment was checked to 
ensure reconstitution of the TRP1 5* region. The resulting 
plasmid is designated pMA35. pMA35 was then cleaved part- 
• ially with EcoRI and the 2^* LEU2 module inserted. Recom- 
binant molecules were screened for the presence of the 
2yLi LEU2 fragment at the pBR322 E coRI site rather than the 
EcoRI site at -103- Such a molecule is pMA36 (Figure 7). 
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EXAMPLE 3 



A number of yeast expression vectors based on the 5 1 
region of the yeast PGK gene were constructed. 

The yeast PGK gene exists on a 2»95kb Hind III frag- 
ment in the yeast- E. coli vector, pMA3, (Figure 1). A 

5 partial restriction map of this molecule is shown in 

Figure 8(a). The PGK Hind III fragment was isolated from 
a Hind III fragment collection inserted into % 762 (Murray 
jet al (1977) Molec. gen. Genet 150 53) using a 32 P labelled 
cDNA prepared from yeast poly-A RNA. The fragment is 

10 identical to the "3»1kb" fragment described by Hitzeman 
e t al (1980) JBC. 255 , 12073 in plasmid pB1 and in hybrid 
selection translation experiments (Ricciardi est al^ (1979) 
P.N.A.S. 7£ 4927) the fragment was shown to encode a pro- 
tein of identical mobility to pure PGK in SDS-PAGE. A 

15 restriction map of the 1.95kb fragment is shown in Figure 
8(b). 

The amino acid sequence of residues 270-400 of yeast 
PGK is shown in Figure 9(a). The sequence was determined 
by manual and automated Edman degradation. The amino acid 

20 sequence data' allowed us to match restriction sites on 
• the 2.95kb Hind III fragment with groups of two or three 
amino acids in the protein sequence. Figure 9(b) shows 
the relevant restriction sites and those sites are marked 
on the amino acid sequence in Figure 9(a)* The positions 

25 of the four sites on the restriction map and the protein 
sequence are congruent allowing us to orientate the gene 
with respect to the sites on the 1.95 kb Hind III fragment. 
Given that the molecular weight of PGK is 40Kd (415 amino 
. acid residues) and assuming that there are no large introns 

30 we can also predict the positions of the 5* and 3' ends of 
the coding sequence. The extent of the coding sequence, 
assuming colinearity, is shown in Figure 8(b), the initi- 
ation codon is about 900 nucleotides to the left of the 
EcoRI site and the termination codon about 300 nucleotides 

35 to the right. 



I 
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The position of the -5' end of the PGK transcript was 
located by the S1 protection method (Berk and Sharp (1978) 
P.N.A.S. 75 1274). The 1.2kb Hae Til fragment spanning 
the 5 f end of the coding sequence (Figure 8(b) was puri- 

5 fied from an agarose gel and hybridised to total yeast RNA. 
The hybrids were treated with various concentrations of S1 
nuclease and the products were analysed on a 1.5R agarose 
gel by Southern hybridisation using the" 1»95kb Hind III 
fragment as probe. Figure 10 shows that the size of the 

10 single protected fragment was 680bp. In this Figure the 
concentrations of S1 in each lane are as follows a) 25 
units b) 50 units c) 100 units. Lane d) has the 1.2kb 
Hae III fragment untreated. On the basis of our previous 
mapping data this would place the 5 1 end of the PGK trans- 

15 cript about 960bp to the left of the EcoRI site on the 
2.95kb Hind III fragment. This agrees well with our 
estimate of the position of the initiation codon and sug- 
gests that if there are any introns between the 5 1 end of 
the transcript and the Bql II site then they are very small. 

20 The 5* "control" region of the PGK gene is in a 

region that contains very few convenient restriction sites, 
making the design of a sequencing strategy relatively 
difficult. We adopted a procedure to solve this problem 
that may be of general use. Plasmid pMA3-PGK was digested 

25 with Sal I (Figure 8) and then with exonuclease BAL 31 to 
• remove about 500bp from each end. This resulted in the 
loss of the two small Sal I fragments and the creation of 
a series of deletions starting at the leftmost Sal I site 
in the PGK sequence and the Sal I site in pBR322 and end- 

30 .ing around the initiation codon in PGK and nucleotide 1150 
in pBR322 respectively. These deleted molecules were then 
ligated in the presence of a 50-fold molar excess of Bam 
HI linkers and then used to transform AKEC28 to. LEU + , 
Amp R . The general structure of these molecules, designated 

35 the pMA22a deletion series is shown in Figure 11. Seventy 
of these deleted molecules have been analysed by measuring 
the length of the EcoRI - B aa HI fragment containing the 
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5' region of the PGK gene. While they show a mean length 
of 1.5kb they have a spread of 500 nucleotides. : This 
collection therefore provides a number of molecules that 
are useful for the sequence analysis of the 5' region of 
5 the PGK gene. Two such deletions, C and W are shown in 
Figure 8(b). The small EcoRI - Bam HI fragments from 
these molecules were purified and cloned in M13mp7Q1 and 
sequenced by the dideoxy-chain termination method Sanger 
ejt al (1977) P.N.A.S. 74 5463, starting in each case at 
10 the Bam HI site and elongating towards the EcoRI site* 

The nucleotide sequence of 226 nucleotides up-stream from 
the inition codon and the first seven codons are shown in 
Figure 12. (In this Figure the box marks the approximate 
position of the 5* end of the transcript). The sequence 
15 was confirmed by sequencing four other deletions with 
overlapping end-points (data not shown). 

The pMA22a deletion series constitutes a collection 
of molecules amongst which are many potential PGK based 
expression vectors. Each with a different sized small 
20 EcoRI-Bam HI fragment and therefore each with a different 
"amount" of the JPGK. 5 1 region. They all have unique Bam 
. a HI sites at which genes may be inserted and expressed. 
Figure 13 shows the sequence of the PGK gene from -226 to 
+624 with the positions of various deletion end-points 
25 marked. The deletion end point numbers (Figure 13) are 

carried through to the name of the plasmid that bears that 
deletion e.g. plasmid pMA279 is a pMA22a deletion with the 
deletion end-points between the codons for amino acids 32 
and 33. At that position the Bam HI linker of sequence 
30 CCGGATCCGG has been inserted. At each of the deletion end- 
. points there is the same BAM HI linker with the exception 
of pMA301 which has the Bql II linker CAAAAGATCTTTTG 
inserted at position -1 . This Bcyl II linker was used in 
order to increase the A content of the region around the 
35 initiating ATG. 

Clearly plasmids pMA278 and pMA301 will produce 
transcriptional fusions with any coding sequence inserted 
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at their expression sites and are therefore of the pMA200p 
type in Table 1, whereas all the others will produce both 
transcriptional and translational fusions (i.e. fusion 
proteins will be made). pMA230 is a +1 (reading frame) 
fusion vector, pMA283 is an in frame (+3) fusion vector. 
The molecules are of the pMA2Q0Tt, f2 and f3 type in 
Table 1. 
EXAMPLE 4 

We have constructed a PGK based expression vector de- 
signated pMA3013 which comprises both 5» and 3« regions 

from the yeast PGK gene. 

We have determined the nucleotide sequence of the 3* 
region of PGK by standard procedures and this is shown in 
Figure 14. Figure 15 shows the scheme for constructing 
pMA3.013- Plasmid pMA3-PGK was cut with Bol II and f st I 
and the fragment containing the 3* end of the PGK gene 
(shown as a wavy line in Figure 15) was purified by the 
method of Tabak and Flavell (1978) NAR 5 2321).. This frag- 
ment was then ligated with Bql II and Pst_I cleaved pMA301 
and the mix was used to transform E.coli strain AKEC28 to 
ampicillin resistance and leucine prototrophy. Resulting 
clones were screened for a plasmid with three Hind III 
sites. Such a plasmid is pMA3013. P MA3013 has a unique 
Bql II expression site flanked by the PGK 5'. and 3* reg- 
ions. 



EXAMPLE 5 

The various yeast expression vectors described have 
been tested using a human interferon-©; as a heterologous, 
potentially useful coding sequence. The sequence is con- 
tained qn a B am HI fragment that is a derivative of plas=- 
mid rv'5H8 originally constructed by Prof. D.C. Burke, 
University of Warwick. Our modification places a Bam HI 
site followed by an AT G at a position corresponding to 



amino ac 



id S15 in the interferon signal sequence. The 
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nucleotide sequence of this B am HI fragment is given in 
Figure 16. The Bam HI fragment can be used in transcript- 
ion fusion constructions because it has its own trans- 
lation initiation codon and it can also be used in vectors 

5 designated to produce fusion proteins. This fragment was 
inserted into the expression sites of a variety of mole- 
cules the general structure of which is shown in Figure 17. 
The resulting molecules were then introducted into yeast 
strain MD40-4C (MD40-4C =&< ura2 t£ft1 leu2-3 leu2-112 

10 his3-1 1 his3-15 )by standard transformation procedures 

(Hinnen et al (1978) P.N.A.S. 75 1919) and the levels of 
interferon produced in yeast were measured using bovine 
EBTr) cells in a viral RNA reduction assay with Semliki 
Forest virus (SFV) as the challenge (Atherton & Burke, 

15 (1975) J . Gen. Virol 29 197). Table 2 shows levels of 

interferon produced in yeast cells containing various re- 
combinant molecules. 
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TABLE 2 



Interferon Expression from Various Vectors 



Expression 
Vector 



5' Region 



3 1 Region Molecules of a 

Interferon per 
cell* 



pMA103 

pMA 36 

pMA278 
pMA301 

pMA3013 

pMA23Q 

pMA3 (control) 



TRP1 

TRP1 (extended) 

PGK ( A 278) 
PGK (^ 301) 

PGK (^301). 

PGK ( ^ 230) 



PGK 



600 

1.7 x 10 4 

2.0 x 10 4 
1.5 x 10 7 

1.0 x 10 7 
1.5 x 10 7 

<50 (not 

detectable) 



Q 

* These figures assume 2 x 10 units of inter feron/mg. 



t 
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It can be seen that there is a considerable range of 
expression capabilities in the system depending on which 
expression vector is used. The highest levels are obtain- 
ed with the fusion protein vector pMA230 and the transcrip- 
tion vectors pMA301 and pMA3013 in which as much as 2% of 
the total cell protein is present as inteferon protein 
(Figure 18). This Figure shows Coomassie stained SDS-PAGE 
protein profiles in which the lanes contain 

(a) Total protein from MD40-4c containing pMA230 

(b) Total protein from MD4Q-4c containing pMA230/ 
interferon 

(c) Protein from MD40-4c containing pMA230/inter- 
feron after partial purification on an NK2 
column. The position of molecular weight 
markers are shown. An arrow marks the position 
of the PGK -interferon fusion protein. 

All interferon producing plasmids are maintained 
stably for at least 40 generations as measured by the 
proportion of cells in the population with the phenotype 
conferred by the expressing plasmid. 

EXAMPLE 6 

p GK in yeast is "induced" by glucose, therefore it 
was of interest to determine whether the structures nec- 
essary for the recognition of this regulatory system, are 
present on the 1500 nucleotide PGK fragment in for example 
pftA230 and if so whether human interf eron-cX. expression 
could be regulated by glucose. 

Yeast strain MD40-4c containing pMA230 with the 
inter feron-o< sequence inserted at the- Bam HI site was 
grown in rich medium with acetate as carbon source for 
twelve generations to a density of 2 x 10 6 cells/ml. 
These cells were used as inocula for two flasks of fresh 
medium. One containing glucose as carbon source and the 
other acetate. A second batch of cells grown on glucose 
was used to inoculate a fresh glucose culture. Therefore 
there were three inoculum/culture conditions: acetate/ 



1 
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acetate; acetate/glucose; glucose/glucose. Aliquots of 
these cultures were taken at various intervals, extracts 
were prepared and interferon levels we're assayed. The 
results of these assays are given in Figure 19 in which 

5 © = glucose/glucosej o = acetate/acetate and A = acetate/ 
glucose. The data in Figure 19 show that the glucose/ 
glucose culture contains relatively high interferon levels 
while the acetate/acetate culture has low levels over the 
course of the experiment.. The acetate/glucose culture 

10 exhibits increasing levels of interferon after the cells 
are transferred to glucose medium (time 13, Figure 190. 
This induction of interferon occurs over a period of. about 
8 hrs. and the levels of interferon produced by cells grown 
on glucose* are 20-30 -fold higher than in cells grown on acetate* 

15 While these results strongly suggest that carbon 

source control of interferon levels is being mediated by 
the 5* control region of the PGK gene it is important to 
establish that there is no difference in plasmid stability 
in cells grown on acetate or glucose. Therefore total DMA 

20 was prepared from aliquots of yeast cells taken at various 
points during the experiment described in Figure 19. The 
DNA was digested with EcoRI and fragments were separated 

on a i-% agarose gel. The fractionated bands were then 

• 32 
blotted onto nitrocellulose and hybridised with P-YRp7. 

25 The pBR322 component of this probe served to measure levels 
of plasmid in the yeast DNA preparations while the sequence 
of the 1.45kb fragment were used to establish a control 
for amounts of DMA, transfer efficiencies and hybrid!- 
sation efficiencies. In addition to this Southern blot 
30 analysis the proportion of L§u cells in the aliquots was 
measured by comparing colony counts on media with and 
without leucine. In all cases Southern hybridisation pro- 
files v/ere identical and > 99% of cells were Leu + (data 
not shown) showing that growth on acetate or glucose has 
no effect- on plasmid copy number or stability. 
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CLAIMS: 



1 . A yeast expression vector comprising a yeast 
selective marker, a yeast replication origin and a yeast 
promoter positioned relative to a unique restriction site 
in such a way that expression may be obtained of a poly- 
peptide coding sequence inserted at the restriction site* 

2. A yeast expression vector according to claim 1 
wherein the yeast promoter comprises at least a portion 
of the 5' region of a ger>e coding for a yeast glycolytic 
enzyme. 

3* A yeast expression vector according to claim 2 

* 

wherein the yeast promoter comprises at least a portion 
of the 5' region of the yeast PGK gene* 
4. A yeast expression vector according to claim 3 

wherein at least a portion of the 5 1 region of the PGK 
gene is located up-stream of the unique restriction site 
and at least a portion of the 3' region of the PGK gene 
is located downstream of the unique restriction site* 
5* A yeast expression vector according to any one of 

the preceding claims expression control of which is 
exercised by varying the level of a fermentable carbon 
source in a nutrient medium of a yeast transformed there- 
with. 

6. A yeast expression vector according to claim 5 

r 

wherein the fermentable carbon source is glucose* 

7. A yeast expression vector according to claim 1 
wherein the yeast promoter comprises at least a portion 
of the 5 1 region of the TRP1 , gene* 

8. A yeast expression vector according to any one of 
the preceding claims wherein the yeast expression vector 
contains at least a portion of the yeast plasmid 2/j 
replication origin and at least a portion of the LEU2 
yeast selective marker. 

9. A yeast expression vector according to any of 
claims 1 to 7 wherein the yeast expression vector con- 
tains at least a portion of an autonomous replicating 



- 25 - 



0073635 



i 



sequence and at least a. portion of an autonomous 
replicating sequence stabilising sequence. 

10. A yeast expression vector according to any one of 
the preceding claims wherein the yeast expression vector 

5 contains at least a portion of a gene coding for a poly- 
peptide. 

11. A yeast expression vector according to any one of 
the preceding claims wherein the yeast" expression vector 
contains at least a portion of a gene coding for human 

10 interferon-^, 

12. A method for the production of a polypeptide 
comprising expressing the said polypeptide in a yeast 
host organism transformed by a yeast expression vector 
according to any of the preceding claims containing a 

15 gene coding for the said polypeptide. 

13. A yeast transformed by a yeast expression vector 
according to any of the preceding claims. 

14. . Saccharomyces c'erevisiae transformed by a yeast 
expression vector according to any of claims 1 to 11. 

20 15. A kit of yeast expression vectors comprising two 
or more yeast expression vectors according to any of 
claim 1 to 9. 

16. A kit of yeast expression vectors comprising four 
or more yeast expression vectors wherein each vector has 
25 either of the TRP1 : ARS1 : ASS or LEU2 : 2y replication 
origin selective marker and replication system and at 
least a portion of either of the TRP1 or PGK 5 f region 
yeast promoter. 
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TRPI FRAGMENT (1 -45 Kb) 

.Alul Alul Alul(2) Alul . 

' ' " 1 

RI 5 » CODING SEQUENCE 3 « EcoRI 

TRPI 5* REGION 

-20 -10 
I I I 

: AAGCACACAAAG GCAG CT TG GA GT ATG — 

Rp t f 

(0) Alul TRPI 

(93) Initiation 



EcoRI 



EcoRI 



Alul 
_J 



Pst 



Bam 



93 bp 



EcoRI 
Bam HI 




T4 LIGASE 



Bam LINKERS 
Bam DIGEST 

T4 LIGASE 



FIG. 3(a) 
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TRANSFORM ECOLI 
PICK AMP R TET S 



1 



EcoRI 



Bam HI 




TRPI 5'REGION 



EcoRI EcoRI EcoRI 
L-iHORJ I j 



LEU 2 
3-25 Kb FRAGMENT 



TA LIGASE 



p MA 103 



FIG. 3(b) 
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EXPRESSION SITE (Bam HI) 



-AAGCACACAAAGGC AG CCGGAICC 

Bam HI 

(93) 

RI 



RI TRP I 5' Region 
Bam(Expression 
Sal sit€ > 




FIG. 4 



Pst 




TRPI 5'Region 
B am (Expression 
Sal site ) 



ORKpBR 322) 



FIG. 5 
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CATA CTATAT ATAT AATATA GAAGCATTTA ATAGAACAGC AXC6TAATAT AT6TGTA6TX 



60 



TGCAGTTATG ACGCCAGATG GCAGTAGTGG AAGATATTCT TTTATTGAAA AATAGCTGTC 



120 



V 



ACCTTACGTA CAATCTGAXC CGGAGCTTTT CTTTTTTTGC CGAXTAAjGAA TTCGGTCGAA 

Tjcogl 



180 



AAAAGAAAAG GAGAGGG CCA AGAGGGAGGG CATTGGTGAC TATTGAGCAC GTGAGTATAC 



240 



GTGATTAAGC ACACAAAGGC AGCTTGGAGT ATG 



FIG.6. 
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FIG. 9 



a) 



270 280 
MEKAK AKGVEVV L P VDF II A 

290 300 
DAFSA S ANTKTVTD KE GI PA 

310 320 
GW Q G L D T GTE^EK L FAA TV A 



330 3^0 
KATV I L WNGP P GV FE FEK FA 

350 360 m 

AGTK ALLD E V VKSTAAGN S V 



370 380 
1 ' > 



I I GG GPTAT VAKKYGVTD KI 

i-Bg 

390 400 
SHVS TGGGASL 



b) 
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FIG. 12 



-226 
* 

AGCCTGCTCT CACACATCTT TCTTCTAACC AAGGGGTGTC XAGTTTAGTA 



176 

* 

GAACCTCGTG AAACTTACAT TTACATATAT ATAAACTTGC ATAAATTGGT 



126 
* 

CAATGCAAGA AATACATATT TGTCTTTTCT AATTCGTAGT TTTTCAAGTT 



76 
* 



1 



CTTAGATGCT TTCTTTTTCT CiUTTTT ACA GATCATCAAG AAGTAATTAT 



26 

* -ni 

CTACTTTTTA CAACAAATAT AAAACA ATG TCT TTA TCT TCA AAG TTG 

MET SER LEU SER SER LYS LEU 



• •• 
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-226 

* ' 

AGCCTGCTCT CACACaTCTT TCTTCTAACC AACCCCTCTT IAGTTTAGTA 



176 
* 

GAACCTCCTC AAACTTACAI TTACATATAT ATAAACTTGC ATAAATTCGT 



•126 

CAAIGCAACA AATACATATT TGTCTTTTCT AATTCGTACT TTITCAACTT 

FIG.13 

* 7 * 278 
CTTACAICCT TTCTTTTTCt CfTTTTtAC^ GATCATCAAC AACTAATTAI 

» 301 283 

CTACTTTTTA CAACAAATAT AAAAcU ATC TCT TTA TCT TCjfc AAA TIG TCT * 

1 MKT SEE LEO SEE SEA LIS LEO SEE 

1 • • • 5 

2 i 230 257 219 

CTC CAA GAT TTG ckc TTG AAG GAC |aAC CCT OTC JJCC ATC AGA CTT OAC TTC 
VAL GLH ASP LEO AS? LEO LY3 ASP LTS AEG VAL PHE XLE AKG VAL ASP PHE 

• 10 20 • • • • 

" 279 ZO 

AAC CTC OCA TTG GAG GOT AAG AAG | ATC | ACT TCT AAG CAA AGA ATT CTT CCT 
ASH VAL PEG LEO ASP GLT LTS LTS ILE THE SEE ASH GUI ABC ILB VAL ALA 

JO 40 • 

T 203 

CCT TTC CCA ACC ATC AAG TAC CTT TTG CAA CAC CAC CCA AGA TAC CTT CTC 
ALA LEO PEO THE ILE LTS TTE VAL LEU CLO HIS HIS PRO ABC TTE VAL VAL 
50 

* AgC I 

TTG CCT TCT CAC TTC CCT AGA CCA AAC CCT CAA AGA AAC CAA AAA TAC TCT 
LEU ALA SEE HIS LEO GLT AEG PEO ASK GLT CLO AEG ASH CLO LTS TTE SEE 

• 60 70 

228 

" 220 

TTG CCT OCA CTT CCT AAG CAA TTG CAA TCA TTG TTC CCT AAG GAT CTC ACC 
LEO ALA PIO VAL ALA LTS GUI LEO GUI SEE LEO LEO CLT LTS ASP VAL THR 

• * -80 90 • 

l J« 271 

TTC TTC AAC CAC TCT CTC CCT CCA CAA OTT CAA GCC |gCT CTC AAC CCT TCT 
PEE LEO ASH ASP CIS VAL GLT PRO CLO VAL CLO ALA ALA VAL LTS ALA SEE 

• • • • • • 100 

330 
* 

GCC CCA OCT TCC CTT ATT TTG TTG GAA AAC TTG CCT TAC CAC ATC CAA CAA 
ALA PEO CLT SEE VAL ILE LEO LEO CLO ASH LEO AEG TTE BIS ILE CLO GLO 

110 120 

381 
* 

GAA CCT TCC AGA AAG CTC CAT CCT CAA AAG CTC AAG* CCT TCC AAG GAA GAT 
CLO CLT SEA AEG LTS VAL ASP GLT GLH LTS VAL LTS ALA SEE LTS GLO ASP 

• -no ko - • • 

4 *S265 

GJTT CAA AAG TTC AGA CAC GAA TTG AGC TCT TTG CCT GAT CTT TAC ATC AAC 
VAL GLH LTS PHE AEG BIS GLO LEU SEE SEE LEO ALA ASP VAL TTE ILB ASH 
150 H50 

4 f 3 246 

CAT GCC TTC GOT ACC CCT CAC AGA OCT CAC TCT Tct ATC CTC CCT TTC CAC 
ASP ALA PHE GLT THE ALA HIS AEG ALA BIS SEE SEE HET VAL GLT PHE ASP 

170 

334 
* 

TTG CCA CAA CGT CCT GCC CCT TTC TTG TTG GAA AAG GAA TTC AAG TAC TTC 
LEO PEO GLH AEG ALA ALA GLT PHE LEO LEO CLO LTS GLO LED LTS TTE PHE 

• 180 190 • • 

S V *?» 

CCT AAC CCT TTG GAG AAC CCA ACC AGA CCA TTC TTC GCC] 
GLT LTS ALA LEO CLO ASH PRO THE AEG PEO PHE LEO ALA 

200 
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*EcoRI 

GAATTC GAA AAG TTC GCT GCT GGT ACT AAG GCT TTG TEA GAC GAA GTT CTC 
GLU LYS PHE ALA ALA GLY THE LYS ALA LEU LEU ASP GLD VAL VAL 

52 

AAG AGC TCT GCT GCT GGT AAC ACC CTC ATC ATT GGT GGT GGT GAC ACT GCC 
LYS SER SER ALA ALA GLY ASN THR VAL ILE HE GLY GLY GLY ASP THE ALA 

103 

* Bgt II 

ACT GTC GCT AAG AAG TAC GGT GTC ACT GAC AAG ATC TAG CAT GTC TCT ACT 
THR VAL ALA LYS LYS TYR GLY VAL THR ASP LYS ILE TYR HIS VAL SER THR 

154 

GGT GGT GGT GCT TCT TTG GAA TTA TTG GAA GGT AAG GAA TTG CCA GGT GTT 
GLY GLY GLY ALA SER LEU GLU LEU LEU GLU GLY LYS GLU LEU PRO GLY VAL 

205 

GCT TTC TTA TCC GAA AAG AAA TAA AT TGAATTGAAT TGAAATCGAT 

ALA PHE LEU SER GLU LYS LYS *** 

STOP 

251 

AGACGAATTT TTTTCTTTTC TCTTTCCCCA TCCTTTACGC TAAAATAATA 



301 
* 

GTTTATTTTA TTTTTTGAAT ATTTTTTATT TATATACGTA TATATAGACT 



3 l l FIG. U 

ATTATTTATC TTTAATGAT 
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EcoRI Hindlll 



FIG. 15 



EcoRI Hind III 



EcoRI 



Hind III 
EcoRI 




EcoRI 



PGK 5 



PstI 



PstI • 



Hindlll 
EcoRI 




Bglll 



PstI 



t 



EcoRI 1 Hind III 



EcoRI 



PGK 3 



\ — fN 



Sail Hindffl Bglll 



Hind III 
EcoRI 





PGK 5 



/ 



Bglll 



PstI 



EcoRI Hind ID 



EcoRI 



Hindlll 
EcoRI 




» 



PGK 5 



Bgl 0 (Expression 
^ . Site) 
PGK y 

Hind HI 



PstI 
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iBomHI 



CCATCC ATC CCC TCC AAC TCA ACC TCC TCT CTC CCC TCI CAT CTC CCT CAA 
MET OX CXS LTS SES 31ft CXS SSft TIL OI CSS AS? LBS FED GU 

52 
* 

ACCCACACCCTCCCrACCACCACSACC TTC ATC CTC CTC CCA CAC ATO ACS 
BIS SIX LKO CLX SEX AtG ASS THE LEO MB LSD LBO ALA GU MB AIC 



103 
• 

AAA ASC TCT CTT TTC TCC TCC TTC AAC CAC ACA CAT CAC TXT CCA TXT CCC 
LTSnjlSXftLBDrnSISCTSLK0t«A»JUftUASPrKBCLTISX7S^ 

154 
* 

CAS CAC CAC TTT CCC AAC CAC TTC CAA AAC GCT CAA ACC ATC OCX CTC CTC 
CLU CLD PHB CLT ASS CUI IBS CLE LTS ALA CLO THE US PSO TAL LSD 



SOS 
* 

CAT CAC ATC ATC CAC CAC ATC TTC AAT CTC TTC ACC ACA AAC CAC TCA TCT 
■UGLOMBZUCUCUrUPBASSLnmSnTBftLTSASVSSl 

256 



CCT TCC CAT CAC ACC CTC CCA CAC AAA TTC TAC ACT CAA CTC TAC CAC 
ALA ALA TIP ASP CUT TEE LCD LEO ASF LTS PBS TTS TXE CLO LCD TTS GLM 

307 



CAC CTC AAT CAC CTC CAA CCC TCT CTC ATA CAS CCC CTC 008 CTC ACA CAS 
LED AS* ASP LED CLO ALA CIS TAL XLS CLE CLT TAL CLT TAL THE CLO 



ACT CCC CTO ATC AAC CAC CAC TCC ATT CTO CCT CTC WG AAA TAC TTC CAA 
TEE ftO LEO MET LTS CLO ASP SSft tLS LED ALA f AL ABC LTS Tift ID CLM 

40* 

ASA ATC ACT CTC TAT CTC AAA CMS AAC AAA TAC ACC OCT TCT CCC TOO SAO 
ABC XLS TBS LSD TTS LSD LTS CLO LTS LTS TTS SSft PSO CTS ALA HP CLO 



4 
-» 



CTT CTC ACA CCA CAA ATC ATC ACA TCT 
TAL TAL ABC ALA CLO XLS MET ABO SSft 

SIX 



TTT TCT TTO TCA ACA AAC TTC CAA 



CAA AST TEA ASA ACT AAC CAA TCA 
CLD SEE LEO ABC SEE LTS CLO 



CCAAATCAT T 



5*0 

TTCAXTAATT 



TATCATCTOC CATTTCAAAC 



410 



CATCACACCA TTTAAATCTT TTTCAAAICT 



710 
a 

TTACACACCA CCAgPCTCAC TCATCCAZTA TCTATTTAAA TATTTTTAAA 

7« FIG. 16. 

ATATTATTTA TTTAACXATT CTTATTTTTQ TTCATAXTAC 



«10 



CTCATCTCCA CCTTTfl A T I CTCCTTAATC TAATAAAATA IPTTCTTXCt 



SCO 



AAAAJ 



IAA IAAAAAAAAA fcAAAAAAAAA 



910 

AAAAAAAAAA IAAAAAAAAA AAAAf COCAT CC 

Bam HI 
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